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ABSTRACT 

A  nonparametric  estimator  of  the  survival  function  F  is  constructed 
from  time-sequential  progressively  censored  data  when  F  is  a  Dirichlet 
process.  The  estimator  is  shown  to  encompass  several  known  estimators 
of  F  including  the  product-limit  estimator  and  Ferguson's  estimator. 


1.  INTRODUCTION. 


The  subject  of  nonparametric  estimation  of  the  survival  function 
from  incomplete  or  censored  observations  has  received  much  attention 
for  more  than  two  decades.  We  may  cite  here  the  celebrated  work  of 
Kaplan  and  Meier  (1958)  where  a  product-limit  (PL-)  estimator  of  the 
survival  curve  is  obtained  from  a  sample  in  which  each  lifetime  may  be 
truncated  (fixed  censorship)  due  to  limits  on  observation.  In  Breslow 
and  Crowley  (1974)  the  properties  of  this  estimator  are  considered  in 
the  case  of  random  censorship,  where  each  lifetime  has  its  own  censoring 
random  variable,  and  the  lifetimes  and  censoring  times  being  each 
independent  and  identically  distributed  (iid)  sequences  and  also 
independent  of  each  other.  By  utilizing  the  notion  of  Dirichlet 
process  priors  introduced  by  Ferguson  (1973),  Susarla  and  Van  Ryzin 
(1976)  obtain  a  nonparametric  Bayesian  estimator  of  the  survival  function 
which  generalizes  the  PL-estimator  of  Kaplan  and  Meier. 

The  basic  formulation  in  these  works  involves  consideration  of 

a  random  sample  of  lifetimes  Xj . Xn  which  may  not  be  completely 

observable  due  to  the  existence  of  corresponding  censoring  variables 
Yl*"',Yn'  The  recorded  data  for  the  sample  is  therefore  (Zj,6j),..., 


(Zn,6n)  where  Z.  =  mir(  1  and  <5..  =  0  or  1  according  as 
X1  >  Yj  or  xt  <  Y^.  In  several  longitudinal  investigations  particularly 
those  related  to  clinical  trials  and  iatrogenic  follow-up  studies  the 
variables  {Z. :  1  <  i  <  n)  are  time-ordered:  <  Z <  •••  <  Z^nj. 

The  first  observation  is  the  smallest  one  followed  by  the  next 
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smallest  Z^j  and  s0  °n  until  the  largest  observation  Z^nj  is  recorded 

last.  In  these  circumstances  cost  and  time  limitations  often  preclude 

prolonged  experimentation  until  the  complete  set  of  data  {(Z^,6^): 

1  <  i  <  n}  has  been  recorded.  Furthermore  cogent  ethical  reasons  make 

it  imperative  that  observation  be  ceased  at  the  earliest  possible  stage 

if  the  current  accumulated  data  warrants  a  clear  statistical  decision. 

Thus  a  progressively  censored  scheme  may  be  advocated  in  which  observation 

is  curtailed  at  an  intermediate  stage  determined  by  the  cumulative 

statistical  information.  If  experimentation  is  terminated  at  the  k‘nth 

stage,  where  kR  €  {l,...,n}  may  be  a  stopping  time,  then  the  recorded 
*  * 

data  is  { (Z^  - ^ ,  6^ ):  1  <  i  <  > ,  with  6^=0  or  1  according  has 

Z^.j  is  a  censoring  time  or  a  true  lifetime. 

In  this  paper  we  construct  a  nonparametric  Bayesian  estimator, 

under  squared  error  loss,  for  the  survival  function  F  from  the  data 
* 

{(Z(ij,6.)-.  1  <  i  <  kn>  when  F  follows  a  Dirichlet  process  prior. 

Our  estimator  thus  generalizes,  to  the  progressively  censored  case,  the 

estimator  of  Susarla  and  Van  Ftyzin  (1976)  and  encompasses  both  fixed 

and  random  censorship.  It  includes,  of  course,  the  cases  in  which  the 

complete  sample  is  observed  (kn  =  n),  an  extension  of  an  estimator 

of  Ferguson  (1973)  when  no  censoring  is  present  and  the  Kaplan-Meier 

estimator.  It  should  be  noted  that  in  a  progressively  censored  scheme 

as  described  here  the  observed  duration  variables  (Z/.x:  1  <  i  <  k  } 

( U  -  n 

it 

and  their  corresponding  identifiers  {<$..:  1  <  i  <  kn>  are  neither 

independent  nor  identically  distributed.  The  absence  of  this  important 
technical  facility  in  the  case  of  progressive  censoring  (which  is  available 
when  the  complete  data  set  is  observed  as  in  the  works  cited  earlier) 
introduces  additional  complications  and  subtleties  in  the  analysis  of 
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progressively  censored  schemes.  For  some  applications  of  progressive 
censoring  see  Sen  et.al.  (1973,  1978,  1981). 

The  substantive  material  in  this  paper  in  distributed  in  the 
following  three  sections.  Section  2  introduces  the  basic  assumptions, 
notation  and  preliminary  notions  and  provides  a  brief  genesis  of  our 
estimator.  Various  special  cases  are  also  dealt  with  here.  We  have 
placed  the  laborious  technical  manipulations  of  construction  in  Section 
4,  while  Section  3  provides  a  numerical  example. 

2.  PRELIMINARIES. 

We  are  concerned  with  longitudinal  studies  in  which  n  specimens 
under  test  are  followed  from  the  onset  with  either  the  time  to  decrement 
(survival  time)  X  or  its  competing  censoring  time  Y  recorded  for 
each  unit  up  to  the  kth  response,  k  €  (l,...,n).  We  suppose  the 
survival  distribution  F  of  X  is  a  Dirichlet  process,  (for  the  de¬ 
finition  of  a  Dirichlet  process  and  other  terms,  see  Ferguson  (1973))  and  given 
F,  the  survival  times  Xj,...,X  of  the  sample  are  independent  and 
identically  distributed  (with  distribution  1-F).  Furthermore  we  consider 
the  corresponding  censoring  times  Yj,...,Yn  to  be  independent  of 
F,Xj,...,Xn,  but  make  no  further  assumptions  on  the  distribution  of  the 
Y.-'s  themselves. 

The  objective  is  to  estimate  the  survival  curve 

F(t)  =  PCX  >  t|FD,  t  >  0  (2.1) 

We  do  not  have  at  our  disposal  the  complete  set  of  data  {(Z^,5^): 

1  <  i  <  n}  where 
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Z.  =  min(X.,Y.) 

6..  =  0  or  1  according  as  >  Y^  or  X^  <  Y^, 


but  rather  the  first  k  order  statistics  (Z 


(I)’* 


’Z(k)}  from 


{Zj,...,Zn>  and  their  corresponding  identifiers  {6j . 5^},  where 

* 

6^  =  1  or  0  according  as  Z^.j  is  a  survival  time  or  censoring  time. 
On  the  basis  of  these  recorded  data  {(Z^,6.):  1  <  i  <  k}  we  seek 

the  Bayes  estimator  F(t)  of  F(t)  under  the  loss  i 


*(F,F)  =  /  (F(x)  -  F(x)J  dw(x) 
0 


(2.2) 


where  w  is  a  weight  function.  Thus  F(t)  is  simply  the  posterior 
conditional  expectation  of  F(t)  given  1  <  i  <  k},  that 

■ft 

Is,  we  need  to  evaluate  E(F(t)|(Z^\,6^):  1  <  i  <  k),  where  E  denotes 

expecation  with  respect  to  the  Oirichlet  process  with  parameter  a. 

As  argued  in  Susarla  and  Van  Ryzin  (1976)  this  may  be  accomplished  in 

two  stages.  First  relabel  the  data  {(Z^. )»$*):  1  <  i  <  k}  as  follows: 

*  *  *  * 

let  Z^jj,...,Z^j  and  Z^+1j . z(k)  denote  respectively  the  ordered 

survival  times  and  ordered  censoring  times  recorded  among  Z^ . Z(k)' 

Now  consider  a  random  sample  of  size  z,  say  from  a  Dirichlet 

process  £  with  parameter  o  and  then  a  random  sample  of  size  (k-z) 
say  n4+j»-  •  •  from  the  conditional  process  of  £  given 
Then  this  conditional  process  is  itself  a  Dirichlet  process  with  parameter 
6,  with  g  given  by 

P( * )  =  a( • )  +  l  I,  ,(*)• 
i=l  {ni} 
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Therefore  if  the  conditional  process  of  F  given 
is  a  Dirichlet  process  with  parameter 

3  =  “  +  I{Z^i)}»  (2,3) 

then  the  construction  of  our  estimator  F(t)  reduces  to  the  evaluation 
of  E(F(t)|(Z^j,0):  i  <  i  <  k)  where  E  now  denotes  expectation  with 
respect  to  the  distribution  of  the  Dirichlet  process  with  parameter  e 
of  (2.3).  This  will  be  shown  to  reduce  to 


E(F(t)<  H  F(2(.,)HF(2,k,)}"'k) 
F(t)  -  — -  (-' - - - 

E({"  F(Z(1)»(F(Z(k)»n'k) 


(2.4) 


We  shall  defer  the  details  of  the  evaluation  of  (2.4)  to  Section  4. 
The  final  form  of  F(t)  can  be  written  as 


F(t)  =  B(t)W(t) 


(2.5) 


where 


a(t,»)  +  N+(t)  +  (n-k)Ct  <  Z,. 

B(t)  =  - T 

o(F  )  +  n 


+  -n C Z / . %  <  t,6 .  =  OD/x. 

m  -  *  j»(z(J)->  -  -  um)  -  <"-*>  -  $  (J)  -  J  J 


j=l  \  a(z(j)»“)  +  N+(Z/ . % )  +  (n-k) 


U)‘ 


a(2(k)>-)  ♦  (n-k)f Z<k)  J  " 


and  CA]  denotes  the  indicator  of  a  set  A, 
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Nt(t)  ’  j l,  CZU)  *  «• 

a(F+  )  =  a(0,®). 


and  Aj  =  number  of  censored  observations  tied  at  Z^j,  and  q  in 
an  exponent  interpreted  as  unity.  It  is  easy  to  see  that  F  is  left 
continuous  at  censored  observations  provided  the  measure  a  has  no 
atoms  at  these  points. 

Several  special  cases  follow  from  (2.5). 

(a)  Suppose  the  entire  data  set  {(Z-»6-):  1  <  i  <  n}  is  available. 

Then  setting  k  =  n  throughout  we  obtain 


F(t)  }  n 

«(F+)  +  n  j=l  1  a(Z.,»)  +  N  (Z.) 


f  x.  -»CZ.  <t»6.  -  03/ A- 

n  f-U,..)  +  N  (zp  +  J  J 


(2.6) 


which  is  the  estimator  given  by  Susarla  and  Van  Ryzin  (1976).  It  is  also 
shown  there  that  in  the  limit  ct(F  +  )  0,  (2.6)  reduces  to  the  Kaplan- 

Meier  product-limit  estimator.  If,  however,  we  have  only  the  partial 
data  set  {(Z^j,6.):  1  <  i  <  k)  of  a  progressively  censored  sample, 

then  for  t  <  Z^j,  the  limit  of  (2.5)  as  a(R+)  -*-0  is 


k  N  (Zm)  +  (n-k)  +  A. 
n  {  w - i. 

j=l  N  (Z(j))  +  (n-k) 


CZ(j)  -  t,5o  =  °VAj 


(2.7) 


Now  writing 


N  (Z(j))  +  (n-k) 


{j:Z,.x<t}|N+(Z,<J  +  (n-k)  +  A. 


where 


Aj  is  the  multiplicity  of  Zyj,  we  find  that  (2.7)  reduces  to 

r  +  *nCZ/.v,  <  ^,6.  =  13 

k  JN  (Z/,x)  +  (n-k)  j  (J)  -  J 


j=l\N+(Z(j))  +  (n"k)  +  xj 


(2.8) 
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If  there  are  no  ties  among  the  uncensored  observations  this  is  precisely 


the  product-limit  estimator  for  t  <  Z^j.  When  t  >  the  behavior 

A  A 

of  F  depends  on  a  even  in  the  limit  and  we  cannot  recover  F  since 
for  any  M  >  0  one  can  choose  measures  which  agree  on  (0,M] 

but  differ  on  (M,«). 

(b)  Suppose  that  in  addition  to  the  entire  data  set  {(Z^.S^):  1  <  i  <  n} 

being  available  there  is  no  censoring  present.  Then  setting  k  =  n 
* 

and  5  t,6^  =  0]  *  0  in  the  terms  following  (2.5)  we  obtain 


p(t)  =  a(t,~)  +  N*(t) 
a(R+)  +  n 


(2.9) 


which  is  the  estimator  of  F (t )  proposed  by  Ferguson  (1973).  Again 
in  the  limit  as  a(R+)  ->  0,  (2.9)  reduces  to 


F(t)  =4^  *  "-1  j,  >  « 


(2.10) 


which  is  the  empirical  survival  function  of  Xj,...,X  . 

On  the  other  hand  if  under  a  progressively  censored  scheme  the 
only  available  data  is  {(Z^j.fi.):  1  <  i  <  k},  then  in  the  absence  of 

observed  censoring  times  among  Z^jj,...,Z^  reduces  (2.5)  to 

fa(t,»)  +  N+(t)  +  (n-k)Ct  <  Z(kj[  f*(Z,n,»)  +  (n-k)Y  (k)  '  t] 


F(t)  = 


a(F+)  +  n 


(k) 

a(Z(k)’“>) 


(2.11) 


We  may  thus  regard  (2.11)  as  the  appropriate  generalization  of  the 
Ferguson  estimator  (2.9)  to  the  progressively  censored  case.  Observe 
that  if  t  <  the  limit  of  (2.11)  as  a(R+)  +0  is  again  the 

empirical  survival  function  (2.10).  For  t  >  Zjkj  this  limit  will 
depend  on  a  and  our  previous  remark  in  (a)  applies. 
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With  the  restrictions  noted  here  we  depict  the  interrelation  among  the 
various  estimators  of  F ( t )  diagrammatical ly  as  follows: 


Our  estimator  (2.5) 


3.  A  NUMERICAL  EXAMPLE 

We  illustrate  here  the  power  of  a  progressively  censored  scheme 
with  the  partial  data  set  { (Z^  ^ , 6^ ) ;  1  <  i  <  k}  (k  <  n),  to  yield 
results  that  are  almost  in  agreement  with  those  obtained  when  the  complete 
survival  profiles  {(2^6^):  1  <  i  <  n}  of  the  sample  have  been  recorded. 

The  data,  taken  from  Johnson  and  Elandt-Johnson  (1980)  (page  179)  represents 
the  survival  times  in  weeks  of  81  patients  in  a  melanoma  study  conducted 
through  the  Central  Oncology  Group  at  the  University  of  Wisconsin,  Madison. 


9 


The  censored  survival  times  are  indicated  by  a  +  sign. 


136, 

58, 

55+, 

18 1  + » 

21, 

23, 

190+ , 

65, 

234 

194+ , 

14, 

90, 

20, 

130, 

213+, 

215  + , 

124, 

108  + 

54. 

98, 

193  +  , 

138, 

141, 

no. 

67  + , 

50, 

26 

103, 

59, 

134  +  , 

147 +  , 

152+ , 

65, 

40, 

34, 

57 

81  +  . 

152+, 

125  +  , 

1 5 1  + , 

34, 

158  + , 

27, 

148 +  . 

27 

132+. 

140+ , 

32, 

130+ , 

38, 

85, 

129+ , 

100  +  , 

19 

118, 

53, 

120+ , 

66, 

46, 

37, 

50+ , 

114+, 

124+ 

26, 

102, 

93 +  , 

80+, 

60, 

86  +  , 

21  +  , 

44+, 

23 

70, 

73  +  , 

19, 

38, 

31, 

25, 

76  + , 

13, 

16  + 

We  choose 

for  our  parameter  a 

the  measure  generated  through 

a(t,“)  =  exp(-et),  t  >  0  where  e  >  0  is  a  real  parameter.  Since  from 
(2.1)  E (F (t ) )  =  a(t,«)/a(R+)  =  e-0t  (expectation  with  respect  to  the 
Dirichlet  process  F),  we  estimate  e  by 


A  reason  for  this  is  that  when  the  censoring  times  {Y^ :  1  <  i  <  n} 

are  i id  and  the  survival  times  are  iid  with  survival  distribution 


F0(t) 

=  exp(-et) 

then 

{9t  :  n 
Kn 

>  1}  converges  in 

probability  to 

e 

when 

n'\  - 1 

as  n 

■*  ®.  We  have  computed  F(t) 

in  three 

cases: 

1)  k 

=  n  =  81; 

2)  k  = 

73  and 

3)  k  = 

65.  The  agreement  between 

the 

three 

curves  is 

very  good  for  time  points 

"  Z(k>- 

t  = 

25 

44 

54 

65 

76 

100 

148 

190 

1) 

.89779  . 

74449 

.69184 

.61091 

.59691 

.54368  . 

35401 

.31245 

2) 

.89736  . 

74382 

.69107 

.61005 

.59597 

.53361  . 

35259 

.24209 

3) 

.89691  . 

74317 

.69034 

.60925 

.59512 

.53267  . 

34307 

.22655 
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4.  PROOFS. 


Recall  the  notation  introduced  in  Section  2.  The  proof  that 
the  conditional  process  of  F  given  {(Z^j,l):  1  <  i  <  n}  is  a 

Dirichlet  process  with  parameter  3  as  specified  in  (2.3),  follows  along 
exactly  the  same  lines  as  that  of  Theorem  4  in  Susarla  and  Van  Ryzin 
(1976).  In  order  to  demonstrate  (2.4)  write 

*  1  * 

E(F(t)|(Z(i),0):  A  <  i  <  k)  =  /  P[F(t)  >  a|(Z(i),0):  A  <  i  <  k]da  (4.1) 


where  E  denotes  (and  in  the  sequel)  expectation  with  respect  to  the 

*  ★ 


Dirichlet  process  with  parameter 
a  Y%  Y.  so  that  X,.  > 

V  yj  >  z(kj  where  z<j)  =  xj  A 

may  be  written  as  the  ratio  I^/I^ 


3.  Now  for  a  <  i  <  k,  6^  =  0,  Z^  j  = 

and  for  k  <  j  <  n  of  course 
★ 

Y..  Therefore  the  integrand  in  (4.1) 

J 

where 


★  * 


11  =  E(P[F(t)  >  a,  X.  €  (Z(i),~),  A  <  i  <  k,  X..,  Yj  6  (Z(k),»),  k  <  j  <  n 


F(t),F(Z(k)),  F(Z(i}),  A  <  i  <  kJ)  (4.2) 


and  I2  is  the  resulting  expectation  obtained  by  suppressing  both  F(t)  and 
"F(t)  >  a"  in  (4.2).  Since  (Yj,...,Yn)  is  independent  of  (F,Xj,... ,Xn) 

•je  -fa  <j^ 

and  including  the  terms  Z^.j  s  X.  e  (0,°°)  (when  6^  =  1),  1  <  i  <  a 
we  get  on  simplification 


Ij  -  E(CF(t)  >  a]. 
xj e  (z(k 


PCX^  6  (0,»),  1  <  i  <  A,  Xj  €  (Z^.j,*),  a  <  i  <  k, 
k  <  j  <  n|F(t),F(Z(k)),  F(zJ.)),  A  <  i  <  k]) 

PC Y j  €  (Z(k),»),  k  <  j  <  n] 


(4.3) 
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Likewise  Ig  is  obtained  from  (4.3)  by  suppressing  [F(t)  >  a]  and 
F(t).  Since  {Xj,...,Xn>  is  a  random  sample  from  F  the  inner  conditional 
probability  in  (4.3)  is  almost  surely 


sLi  F(z(i))HF(z(k)))"'k 

Finally  using  (4.4)  in  I^/Ig  and  carrying  out  the  integration  in  (4.1) 
we  obtain  (2.4). 

We  are  now  left  with  the  tedious  task  of  carrying  out  the  integrations 

in  (2.4).  Several  cases  must  be  considered  depending  on  the  position 

*  * 


of  the  time  point  t  among  the  observed  points  Z^+1j , . . .  and 

Z^kj  (=  max{Z^.j:  1  <  i  <  k}). 

Suppose  Z^+jj,...,Z^mj  denote  the  distinct  ordered  values 


among  ^(£+i)*****^(|() 
Thus 


with  corresponding  multiplicities  x 


4+1 


X.  >1,  £<i<m;l<£<k<n  and  £  <  m  <  k,  £  x.  =  k-£. 

1  "  -  -  -  -  .  i=£+1  i 

The  largest  recorded  observable  Z^  may  be  either  a  survival  time  or 
a  censoring  time.  Suppose  we  are  in  the  latter  case  so  that  Z^  =  z|mj. 
Consider  the  case  t  >  Select  the  partition  of  R+  =  (0,°°) 

given  by  the  points  {z|^:  £  <  i  <  m  +  2}  where  Z+^^  =  O.Z^jj  =  t 
and  2(^2)  =  00 ■  Then  defining  =  F(Z^.j)-  F(Z^+Jj),  £  <  i  <  m  +  1, 
the  random  vector  (U^ , . . -  •W  has  the  Dirichlet  distribution  with 
parameter  (e^, . . .  where 


Bi  =  B(Z(1),Z(i+l)] 


(4.1) 


and  6  as  given  in  (2.3). 
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+  i_1 

Now  F(Zrx)  =  (1-  l  IL),  i  <  1  <  m  +  1.  Therefore  the  integrand  in  the 
j=£  J  m  "  .  x.  .  k 

numerator  of  (2.4)  —  F(t){  n  (F(Z*.%))  1 >{F(Z#  %)  }  can  be  written 


i=£+l 

itH-1  i-1  x.  m-1  _  k 

H  (1-  l  uj’d-z  U.)  . 

i=£+l  j=£  J  j=£  J 


(4.2) 


with  x^j  =  1.  For  the  denominator  in  (2.4)  the  integrand  is  the  same 


as  (4.2)  except  that  x^j  =  0. 


Now  (UA,...,Um)  has  probability  density 


nH-1 

V  m  e.-l  m  e^.,-1 

-  H  n  u.J  }(1-  l  u.)  mL  ,  0  <  all  u.  <  1  and 

n n-l  •_  0  j  *_  Q  J  J 

n  r(B.)  J  1  3~l 

i=£  1  m 

0  <  l  u  ■  <  1 » 
j=£  J 

where  r  denotes  the  Gamma  function.  The  expectation  of  (4.2)  involves 

integration  over  the  variables  u£,...,um.  Suppressing  all  terms  not 

involving  u  ,  the  integral  over  u  is 
m  m 


m-1 

(1-1  u.) 

j=A  6m_1  m_1  (emfi+Xmfl)-1 

/  n.  (1  -  l  u.-uj  ",fl  nH1  du„ 

0  m  fit  3  m  " 


_  r(em>r(WW  m-J  ,  .VWW1 


Now  proceeding  with  the  successive  integrations  over  um_j» • • • »u£  we 
finally  obtain  for  the  numerator  of  (2.4) 


III*  A  III*  X 

r(  I  3,)  fr(g  )rfe  +x  m-1  r^.-)r(  l  ( B^x- )+ (n-k) ) 

i=£  1  \  )  nVngnH-rVlJ  (  m  l  1  j=i+l  J  3 _ 

m+1  /  \  r(e +6m+1+x_. , )  (  \  nn-1 

n  r(8.)  I  m  nH-1  1  J  i-i  r( p  +  j  (e.+x.)+(n-k)) 

i=£  1  )  l  1  j=i+l  3  3 


r^Bm+BnH-l+*nH-l^ 


(4.3) 


V 
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Note  that  in  (4.3)  the  value  of  a^j  is  1.  For  the  denominator  of 
(2.4)  we  have  the  same  expression  except  that  a  j  =  0. 

Proceeding  with  cancellations  of  the  common  factors  in  the  numerator  and 
denominator  of  (2.4)  yields  our  estimator 


m 


»  ^nri-1  ^ 

F(t)  =  (F75— ) 1  » 

Bm  BnH-l  j=£+l 


<Bi+xi>*W<"-k> 

_lzJ _ _ _ 


m 


J.  («1n1)*W<"-k,+f>M 

V.  >“J  j 


(4.4) 


Recall  (4.1)  and  (2.3).  A  trite  computation  shows 


em+1  =  a(t,°°)  +  #  (observed  lifetimes  >  t) 


Bm+BnH-l  =  a^Z(m)’”^  +  #  (°bserved  lifetimes  >  l+^) 

m  +  + 

J  (Vxi)+Vl  =  a^Z(j)’°°^  +  #  (observed  lifetimes  >  Z^j) 

1  "*J 

+  #  (observed  censoring  times  >  Z^.j) 

■  “(z(j)-”>  +  N*<z(j)>  +>j 


i  <6i+xi>+vi+6j.i  ■  + n 

Substituting  in  (4.4)  and  again  cancelling  out  common  factors  we  get 
for  the  product  term  in  (4.4) 

a(ZW“)+NMm))+(n-k)nm  J  f'«(Z(j)^H+(z|.)H(n-k)n. 

-  /  7*^  /  *7*^’  /-  I.  \  .  .  1  1  /  •7*^’  \  .  11^  /*7^’  \  .  /_  I.  \ 


(4.5) 


°^Z(£)’”)+N  ^Z(£)^+^n“k^ 


j=£+l  \a(Z(j),«)+NT(Z(j))+(n-k) 


*(4.6) 


Now  Z^mj  =  and  Z^j  =  0.  Also  in  the  case  considered  here 

N+(Z^m))  =  0,  N+(t)  =  0.  Using  these  in  (4.5)  and  (4.6)  yields 
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which  is  the  form  of (2.5)  for  this  case. 

All  other  cases  are  handled  in  exactly  the  same  manner  and  lead 
to  the  general  form  of  F(t)  given  in  (2.5). 

5:  CONCLUDING  REMARKS 

It  can  be  shown  that  when  the  censoring  times  {Y^ :  i  >  1} 
are  iid  with  continuous  right  distribution  function  G  on  (0,-=°),  the 
survival  times  {X^:  i  >  1}  are  i id  with  continuous  right  distribution 
F,  and  n-1kn  ->  y  €  (0,11,  then  for  any  T  >  0  with  F(T)G(T)  >  1-y, 
the  process  {n2(F(t)  -  F ( t ) ) :  t  €  C0,T}  converges  weakly  to  a  Gaussian 
process.  Furthermore  under  appropriate  conditions  strong  convergence 
and  consistency  can  be  demonstrated. 
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